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1 

METHOD FOR DETECTING AND MODIFYING MISMATCHES AND MUTATIONS 

IN NUCLEIC ACID 

5 Detailed Description of the Invention 

Field of the Invention 

The present invention relates to a method for detecting mismatches in a double- 
stranded nucleic acid, a method for detecting a nucleic acid with mutations, and a method for 
10 separating a double-stranded nucleic acid with mismatches, in which methods MutM protein 
is used. 

Background Art 

E. colt MutS is a protein that recognizes and binds to mismatches (S-S Su et al. v J. 
Biol. Chem., 263, 6829-683S (1988)). Recently, a method for detecting mismatches and a 

15 gene diagnosis method using MutS have been developed (M. Gotoh et aL, Genet. Anal., 14, 
47-50 (1997)). The strength of MutS binding is known to vary in the kind of base mismatch. 
In particular, the binding of MutS to pyrimidine-pyrimidine mismatches is weak (M. Gotoh et 
al„ Genet Anal., 14, 47-50 (1997)). Therefore, it is very likely that mutations would be 
missed if MutS is applied for gene diagnosis. On the other hand, £. coti MutM is a protein 

20 that inhibits transversion mutation from guanine/cytosine to thymine/adenine (M. Cabrera et 
aL, J. Bacterid., 170, 5405-5407 (1988)). E. coli MutM recognizes and removes the oxidized 
base 8-oxoguanine in the context of an 8-oxoguanine/cytosine base pair to suppress 
transversion mutation (M. L. Michaels et aL, Biochemistry, 31, 10964-10968 (1992)). 

25 With respect to cleavage at mismatched base pairs, celery nuclease CEL1 (C. A. 

Oleylowski et aL, Nucleic Acids Res., 26, 4597-4602 (1998)) and T4 phage endonuclease VH 
(R. Youil et aL, Proc. Natl. Acad. Sci. USA, 92, 89-91 (1995)) are known to cleave 
mismatches. Discrimination among different kinds of base pair mismatches is rather low for 
these enzymes, because they cleave almost all mismatches. 

30 

Problems to Be Solved bv the Invention 

Objectives of the present invention are to provide (1) a method for effectively 
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detecting mismatches or mutations in a nucleic acid, (2) a method for effectively separating a 
population of nucleic acid molecules based upon the presence or absence of mismatches and 
(3) a method for effectively modifying a mismatch-bearing nucleic acid, using MutM protein 
in all three methods. Particularly, the method enables effective detection of mismatches 
5 between pyriinidines, which has been difficult to achieve by conventional methods using 
MutS protein. The present invention also provides a method for separating nucleic acids based 
upon the presence or absence of mismatches and a method for modifying a nucleic acid with 
such mismatches. 

In one embodiment, the present invention provides a gene diagnosis method using 
10 the ability of MutM protein to recognize mismatches, a method for purifying a DNA 
amplification product and a method for labeling a mutant nucleic acid. 

Means of Solving the Problems 

The present inventors have tested the ability of £. colt MutM to recognize 

15 mismatches in a DNA and found that MutM protein can recognize all kinds of mismatches 
involving cytosine (Figure 1). Moreover, the present inventors have found that MutM protein 
also binds to not only a single mismatch but also to multiple continuous mismatches (Figure 
2). In addition, the present inventors have found that MutM protein also binds to not only 
single base-single base mismatches but also to single base-multiple base mismatches, in 

20 which multiple bases are inserted into one strand of a double-stranded nucleic acid (Figure 3). 
Moreover, the present inventors have found that MutM protein also binds to mismatches 
generated by deletion or insertion of a single or multiple bases in one strand of a double- 
stranded nucleic acid (Figure 4). 

Furthermore, the present inventors have succeeded in separating a DNA with and 

25 without mismatches from polymerase chain reaction products using £1 coli MutM. In other 
words, the present inventors have found that using MutM protein enables detecting 
mismatches in a DNA and separating a DNA with mismatches* 

Furthermore, the present inventors have found that such ability of MutM protein is 
applicable to gene diagnoses* 

30 Furthermore, the present inventors have succeeded in cleaving a double-stranded 

nucleic acid at mismatches. In other words, the present inventors have found that using MutM 
protein, enables separate a double-stranded nucleic acid from a double-stranded nucleic acid 
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sample and modify a double-stranded nucleic acid at mismatches. 

Thus, the present invention relates to a method for detecting mismatches in a double- 
stranded nucleic acid or a nucleic acid with mutations using MutM protein, and a method for 
separating a double-stranded nucleic acid with mismatches or a nucleic acid with mutations 
5 using MutM protein. More specifically, the present invention relates to: 

(1) a method for detecting mismatches in a double-stranded nucleic acid, wherein the method 
comprises 

(a) contacting a test double-stranded nucleic acid with MutM protein and 

(b) detecting the binding of said double-stranded nucleic acid to said protein; 
10 (2) the method of (1), wherein MutM protein is derived from E. coli; 

(3) the method of (1) or (2), wherein the test double-stranded nucleic acid is binding to a 
support or labeled so as to bind to a support; 

(4) the method of (1 ) or (2), wherein MutM protein is binding to a support or labeled so as to 
bind to a support; 

15 (5) the method of any one of (1), (2), or (4), wherein the test double-stranded nucleic acid is 
detectably labeled; 

(6) the method of any one of (1) to (3), wherein MutM protein is detectably labeled; 

(7) a method for detecting mutations in a nucleic acid, wherein the method comprises 
(a) providing a test nucleic acid and a control nucleic acid, 

20 (b) hybridizing said test nucleic acid with the control nucleic acid, 

(c) contacting the double-stranded nucleic acid formed by hybridization with MutM protein, 
and 

(d) detecting the complex between heteroduplex nucleic acid in said double-stranded nucleic 
add and said protein; 

25 (8) the method of (7), wherein MutM protein is derived from E. coli; 

(9) the method of (7) or (8), wherein the test nucleic acid or the control nucleic acid is binding 
to a support or labeled so as to bind to a support; 

(10) the method of (7) or (8), wherein MutM protein is binding to a support or labeled so as to 
bind to a support; 

30 (11) the method of any one of (7), (8), or (10), wherein the test nucleic acid or the control 
nucleic acid is detectably labeled; 

(12) the method of any one of (7) to (9), wherein MutM protein is detectably labeled; 
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(13) a method for separating a double-stranded nucleic acid with mismatches from a double- 
stranded nucleic acid sample, wherein the method comprises 

(a) contacting the double-stranded nucleic acid sample with MutM protein and 

(b) collecting a double-stranded nucleic acid that forms a complex with MutM protein from 
5 the double-stranded nucleic acid sample; 

(14) a method for separating a double-stranded nucleic acid without mismatches from a 
double-stranded nucleic acid sample, wherein the method comprises 

(a) contacting the double-stranded nucleic acid sample with MutM protein and 

(b) collecting a double-stranded nucleic acid that does not bind to MutM protein from the 
10 double-stianded nucleic acid sample; 

(15) the method of (13) or (14), wherein MutM protein is derived from £. colt, 

(16) the method of any one of (13) to (15), wherein the double-stranded nucleic acid is a DNA 
amplification product; and 

(17) the method of any one of (13) to (16), wherein MutM protein is binding to a support or 
15 labeled so as to bind to a support 

"Mismatch"used herein means that a base pair selected from adenine (A), guanine 
(G) v cytosine (C), and thymine (T) (or uracil (U) in RNA) is not a normal base pair (A/T or 
G/C). In this invention, a "mismatch" includes not only a single mismatch but also multiple, 
continuous mismatches that are generated by inserting or deleting a single or multiple bases or 

20 their combinations. 

"Mutation" used herein means a base (or a base pair in the case of a double-stranded 
nucleic acid) in a sample nucleic acid different from that in a control nucleic acid* Thus 
polymorphisms (specific sequence variants occurring at greater than 1% frequency within 
natural populations) are included in this definition of "mutation". 

26 "A nucleic acid" used herein includes a DNA or an RNA, for example, a cDNA, a 

genomic DNA, an mKNA, or a synthetic oligonucleotide. It also includes a single-stranded 
and a double-stranded nucleic acid, and a linear and a circular nucleic acid* 

**A control nucleic acid" means a reference nucleic acid without mutations. In the 
case that natural polymorphisms exist in the base sequence, one of the most common 

30 sequence variants is chosen as the reference. "A sample nucleic acid" means a nucleic acid 
suspected to have bases different from those in the control nucleic acid (mutations). A sample 
nucleic acid is the same as a control nucleic acid if it does not have mutations. In other words, 



5 

it is different fiom a control nucleic acid only at the mutation sites if it has mutations. For 
example, when mutations in genes of a patient suspected to have a genetic disease are to be 
detected, a gene of the patient suspected to have mutations is a sample nucleic acid and the 
corresponding gene of a healthy subject is a control nucleic acid 
6 M A heteroduplex nucleic acid" means a substantially complementary double-stranded 

nucleic acid but with regions that are not complementary due to one or multiple mismatches* 
"A test nucleic acid*' is formed by melting and re-annealing control and sample nucleic acids 
together so that double-stranded nucleic acid molecules form which are composed one strand 
each from the control and sample nucleic acids. Thus if a sample nucleic acid contains 
10 mutations, a test nucleic acid derived fiom it will include heteroduplex molecules. 



Mode for Implementing the Invention 

In one aspect, the present invention relates to a method for detecting mismatches in a 
double-stranded nucleic acid using MutM protein. The method of the present invention 

15 utilizes the ability of MutM protein to recognize mismatches in a double-stranded nucleic 
acid. In this method, mismatches are detected in terms of binding of MutM protein to a test 
double-stranded nucleic add Therefore, the method of the present invention comprises (a) 
contacting a test double-stranded nucleic acid with MutM protein and (b) detecting the 
binding of said double-stranded nucleic acid to said protein. 

20 The method of the present invention is particularly suitable for detecting mismatch 

base pairs involving cytosine (G/A, C/T t C/C). The method can also be applied to detecting 
multiple continuous mismatches, mismatches between a single base and multiple bases, and 
mismatches generated by deleting and/or inserting a single or multiple bases in at least one 
strand of a double-stranded nucleic acid. In particular, the method is suitable for detecting 

25 mismatches with cytosine. 

MutM protein derived fiom E. coli is preferable as the "MutM protein** used for the 
method of the present invention. However, there is no limitation on its source as far as it can 
recognize mismatches in a double-stranded nucleic acid. To date, yeast Oggl and Ogg2 (P. A 
van der Kemp et ah, Proc. Natl. Acad, Sri, USA, 93, 5197-5202 (1996)), mouse Oggl (T, A. 

30 Rosenquist et al., Proc. Natl. Acad. Sci. USA, 94, 7429-7434 (1997)), Thermus thermophilic 
MutM (T. Mikawa et al., Nucleic Acids Res,, 26, 903-910 (1998)), Arabidopsis thatiana 
AtMMHl and AtMMH2 (T. Ohtsubo et al., Mol. Gen, Genet, 259, 577-590 (1998)), human 
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Oggl (K. Arai el aL, Oncogene, 14 t 2857-2861 (1997)), etc, arc known as homologues to 
MutM proteins as well as to that of E.colL The partial peptides of these proteins can also be 
used as long as they can recognize mismatches in a double-stranded nucleic acid. 

Moreover, proteins (mutants) generated by performing one or more amino acid 
5 substitutions, deletions, additions, and/or insertions into the wild-type proteins can be used as 
long as they can recognize mismatches in a double-stranded nucleic acid. Such mutants can 
occur spontaneously or can be prepared artificially. Many methods for introducing amino 
acid mutations into proteins are well known. For example, the known methods include site- 
directed mutagenesis (W. P. Deng and J, A. Nickoloff, Anal, Biochem., 200, 81 (1992), and K. 

10 L* Makamaya and F. Eckstein, Nucleic Acids Res., 14, 9679-9698 (1986)) and random 
mutagenesis such as the method using E. coli XL 1 -Red strain (Stratagene), which is 
deficient in basic repair systems, and the method in which bases are modified chemically 
using sodium nitrite (J,-J, Diazetal. t BioTechnique, 11,204-211 (1991)). 

MutM protein can be a fusion protein with another protein such as glutathione- S- 

1 5 transferase. 

MutM protein can also be prepared as a wild-type or recombinant protein by a 
known method in which anion exchange column chromatography, cation exchange column 
chromatography, gel filtration column chromatography, ammonium sulfate fractionation, etc. 
are combined (S* Boiteux et ah, EMBO J,, 6, 3177-3183 (1987)). The recombinant protein 

20 can be easily prepared by chromatography alone using a cation exchange column and gel 
filtration column if the expression level is high. 

The double-stranded nucleic acid used in the present invention can be the double- 
stranded nucleic acid to be examined to determine whether or not it has mutations* The 
double-stranded nucleic acid can be in any form of a double-stranded DNA, double-stranded 

25 RNA, or DNA/KNA, The double-stranded nucleic acid can be used in the test as it is or after 
being amplified in a vector such as a phage or plasmid. A double- stranded nucleic acid 
amplified by polymerase chain reaction (PGR) can also be used* 

In the present invention, the test double-stranded nucleic acid can be contacted with 
MutM protein undo- conditions (appropriate pH, solvent, ionic environment, and temperature) 

30 in which said protein can bind to the mismatch region(s) of the test double-stranded nucleic 
add. An example of the buffer composition is SO mM Hepes-KOH (pH 7.2), 100 mM KC1, 1 
mM EDTA, and ImM DTX An example of temperature is 2S°C and that of the reaction time 
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is about 1 to 10 minutes. The conditions mentioned above are only examples; more precise 
conditions of reaction temperature, salt concentration, the kind of ions, and buffer pH can be 
appropriately selected. 

When single-stranded nucleic acids are expected to remain during the preparation of 
5 double-stranded nucleic acids, the single-stranded nucleic acids should be removed by, for 
example, using a MicroSpin S-300 HR column (Amersham Pharmacia Biotech) or blocking 
the nucleic acids with E. coli SSB protein beforehand. 

The method for detecting the binding of a test double-stranded nucleic acid to MutM 
protein is not limited. Examples of the detection system are as follows. 

10 (1) A test double-stranded nucleic acid is immobilized on a support or labeled so as 

to be immobilized on a support, and MutM protein is used without being labeled. The nucleic 
acid can be labeled so as to be immobilized to a support by binding one of the two substances 
with affinity to the nucleic acid and the other to the support. Biotin-avidin system and 
antibody-antigen system (for example, anti^digoxigenin antibody and digoxigenin) may be 

15 used as these substances. MutM protein can be directly detected using a detection system 
with a sensor using, for example, a quartz oscillator, surface plasmon resonance, or porous 
silicon, Examples of the support include the gold on a quartz oscillator, the gold on the 
sensing element of a surface plasmon sensor, or the silicon of a porous silicon sensor 
Substances can be immobilized on the support directly or through a matrix of dextran and so 

20 on (IC Bondeson ct al., FEES Letter, 423, 307-313 (1993)). 

(2) A test double-stranded nucleic acid is immobilized on or labeled so as to be 
immobilized on a support. After MutM protein is reacted with the test double-stranded 
nucleic acid, the MutM protein unbound from the nucleic acid is removed, then the remaining 
MutM protein is detected. The nucleic acid can be labeled with biotin or compounds 

25 recognized by antibodies so as to be immobilized on a support* MutM protein can also be 
labeled with a detectable compound including radioactive substances such as 33 S or 3 H 1 a 
fluorescent substance such as FITC, biotin, or a compound detectable with an antibody such 
as FITC Alternatively, MutM protein can be detected without labeling if an antibody against 
MutM protein is used The complex between an antibody and MutM protein can be detected 

SO by binding another antibody to the complex. An antibody to which alkaline phosphatase, 
horseradish peroxidase (HRP), or p-galactosidase binds can be used. In this case, the 
detection can be performed by a known coloring method or chemiluminescence method in 
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which the activity of alkaline phosphatase, horseradish peroxidase (HRP), or P-galactosidase 
is used. The protein labeled with a radioactive or fluorescence substance can be directly 
detected. Any support that separates solid from liquid, such as membrane filters, microtiter 
plates, chromatographic carriers, or magnetic beads, can be used. When biotin is to be 
5 detected, avidin or streptavidin is immobilized on the support When a compound 
recognizable by an antibody is to be detected, an antibody is immobilized on a support 
Antibodies can be immobilized by physical adsorption or chemical binding using a cross- 
linking agent directly or through protein A or protein G. 

(3) A test double-stranded nucleic acid is detectably labeled and MutM protein is 
10 immobilized on or labeled so as to be immobilized on a support. The nucleic acid can be 

labeled with a radioactive substance such as 35 S or 3 H, a fluorescent substance such as FITC 
or CyS, an enzyme such as HRP, or a compound such as biotin, detectable with avidin or 
streptavidin, or FITC or digoxigenin, detectable with an antibody. MutM protein can be 
immobilized on a support directly by physical adsorption or chemical binding using a cross- 

15 linking agent* Alternatively, the protein can be labeled with biotin or a compound detectable 
with an antibody (antigen) and immobilized on a support through avidin or streptavidin for 
biotin or through an antibody for the antigen. Furthermore, the N-terminus or C-tenninus of 
MutM protein is tagged with a short histidine residue and immobilized through metal by 
utilizing the chelate action of the tag. The detection system and the support used are the same 

20 as those mentioned in (2)* 

(4) A test double-stranded nucleic acid is detectably labeled without labeling the 
MutM protein. The solid and liquid are separated, such as by immunoprecipitation, using 
anti-MutM antibody. The antibody against MutM protein can be immobilized through protein 
A or protein G bound to a support such as beads. A nucleic acid can be labeled as described 

25 in (3). The detection system and the support used are the same as those mentioned in (2). 

(5) A test double-stranded nucleic acid is not labeled, and MutM protein is 
immobilized on, or labeled so as to be immobilized on, a support* The labeling substance that 
can be immobilized on a support is the same as described in (3)* A double-stranded nucleic 
acid can be directly detected in the same manner as in (1), for example, with a sensor using a 

30 quartz oscillator, surface plasmon resonance, or porous silicon. The support is the same as 
described in (1). The protein can be immobilized on the sensing element of a surface plasmon 
resonance sensor by the method of L Chaiken (Anal. Biochem., 201, 197-210 (1992)). 
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If significant binding of the test double-stranded nucleic acid to MutM proteins is 
detected, it is judged that the test double-stranded nucleic acid has mismatches. In contrast, if 
significant binding of the double-stranded nucleic acid to MutM proteins is not detected, it is 
judged that the test double-stranded nucleic acid does not have mismatches, 
5 The method of the present invention for detecting mismatches includes quantification 

of mismatches. For example, it is possible to determine the amount of mismatches in a 
nucleic acid sample by measuring the amount of MutM protein bound to a nucleic acid with 
mismatches using a labeled protein or an antibody. It is also possible to determine the 
proportion of mismatched DNA in a sample by labeling the whole nucleic acid sample, 

10 binding MutM protein to the sample, and measuring the proportion of the nucleic acid that has 
formed a complex to the whole nucleic acid sample. 

In another aspect, the present invention relates to a method for detecting mutations in 
a nucleic acid using the MutM protein. In one embodiment, this detection method can be 
used to examine whether or not a specific gene of a patient suspected to have genetic disease 

j5 has mutations by examining whether or not the gene derived from the patient and the gene of 
a healthy subject have the same nucleotide sequence. The method of the present invention 
enables detecting mutations wherever they exist in a test gene. The method is also superior in 
that the mutation site of a test gene or the kind of mutation need not to be known. 

The principle of this detection method is as follows. A test nucleic acid suspected to 

20 have mutations and a control nucleic acid (a nucleic acid without mutations) are prepared and 
hybridized with each other. If the test nucleic acid has mutations, a heteroduplex nucleic acid 
(a nucleic acid with mismatch(es)) is generated by hybridizing with the control nucleic acid. 
In contrast, if Che test nucleic acid does not have mutations, only a homoduplex nucleic acid is 
generated, and no heteroduplex nucleic acid is generated. If the MutM protein is contacted 

25 with the double-stranded nucleic acid formed by hybridization, Che MutM protein binds to the 
heteroduplex nucleic acid, which has mismatches, but does not bind to the homoduplex 
nucleic acid. Therefore, it is possible to judge whether or not a test nucleic acid has mutations 
by detecting the binding of the MutM protein to the double-stranded nucleic acid. 

More specifically, the detection method of the present invention comprises 

30 (&) providing a test nucleic acid and a control nucleic acid, 

(b) hybridizing the test nucleic acid with the control nucleic acid, 

(c) contacting the double-stranded nucleic acid formed by hybridization with an MutM 
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protein* and 

(d) detecting the complex between the heterodttplex nucleic acid in the double-stranded 
nucleic acid and the protein. 

There is no limitation on the test nucleic acid used. Any desired nucleic acid to be 
5 examined to determine if it has mutations can be used. A control nucleic acid corresponds to 
a test nucleic acid In other words, if the test nucleic acid does not have mutations, it is the 
same nucleic acid as the control nucleic acid. The word "the same" means that both are the 
same within the region where they hybridize with each other. The length can be different but 
should be the same if possible* A test nucleic acid and a control nucleic acid can be either 

10 single-stranded or double-stranded. When they are single-stranded, they are complementary 
to each other provided that the test nucleic acid does not have mutations. 

In the method of the present invention, a test nucleic acid and a control nucleic acid 
are hybridized. (If they are double-stranded, both are denatured and dissociated to form 
single-stranded nucleic acids and the single-stranded nucleic acids are hybridized.) By doing 

15 so, double-stranded nucleic acids are formed (heteroduplex and homoduplex nucleic acids are 
formed if the test nucleic acid has mutations, and only homoduplex nucleic acids are formed 
if the test nucleic acid does not have mutations). 

The double-stranded nucleic acids can be denatured by exposing them to acidic or 
alkaline pH or to a high temperature in a solution. The pH can be changed by replacing the 

20 solution with, for example, 0. 1 M NaOH or 0. 1 M HCL The temperature can be raised to the 
melting temperature (Tin) of the nucleic acid or higher It is usually set to about 95°C. 

Hybridization can be easily performed by returning the pH of the solution to neutral 
or lowering the temperature gradually to set it the same as or lower than Tin. For example, 
for a 200 base-pair nucleic acid, the same mole of a test nucleic acid and a control nucleic 

25 acid, or excess mole of the one to the other, is added in 6xSSC solution (90 mM sodium 
citrate (pH 7.2) and 0.9 M NaCl). The temperature is raised to 95°C once then gradually 
cooled to room temperature over 30 minutes to 2 hours. The mixture is then treated with 
MicroSpin S-300 HR column (Ameisham Pharmacia Biotech) if the single-stranded nucleic 
acid that does not hybridize is removed 

30 Next, the double-stranded nucleic acid formed by hybridization is contacted with the 

MutM protein. The MutM protein used is the same as those described in the method of 
detecting mismatches mentioned above. The binding of the double-stranded nucleic acid to 
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the MutM protein is then detected in the same manner as in detecting the binding of a test 
double-stranded nucleic acid to the MutM protein mentioned above. 

Examples of the detection system are the same as mentioned above. However, when 
a nucleic acid iB immobilized on or labeled so as to be immobilized on a support, either a test 
5 nucleic acid or a control nucleic acid can be immobilized or labeled. 

If significant binding of a double-stranded nucleic acid formed by hybridization to 
MutM proteins is detected, it iB judged that a test double-stranded nucleic acid has 
mismatches. In contrast, if significant binding of a double-stranded nucleic acid to MutM 
proteins is not detected, it is judged that a test double-stranded nucleic acid does not have 
10 mismatches. 

hi another aspect, the present invention relates to a method for separating a double- 
stranded nucleic acid with or without mismatches from a double-stranded nucleic acid sample 
using MutM. The principle of the method of the present invention is as follows. First, a 
double- stranded nucleic acid sample that is expected to contain a heteroduplex nucleic acid is 

15 prepared, and the MutM protein is contacted with this sample. Since the MutM protein binds 
only to a heteroduplex nucleic acid in the double-stranded nucleic acid sample, the 
heteroduplex nucleic acid binding to the MutM protein is collected if the MutM protein is 
collected from the double-stranded nucleic acid sample contacted with MutM. A homoduplex 
nucleic acid can be collected by removing the MutM protein and double-stranded nucleic 

20 acids binding to the protein from the double-stranded nucleic acid sample* 

More specifically, the method of the present invention for separating a double- 
stranded nucleic acid with mismatches comprises 

(a) contacting a double-stranded nucleic acid sample with the MutM protein and 

(b) collecting a double- stranded nucleic acid that forms a complex with the MutM protein 
25 from the double-stranded nucleic acid sample. 

The method of the present invention for separating a double-stranded nucleic acid without 
mismatches comprises 

(a) contacting the double- stranded nucleic acid sample with the MutM protein and 

(b) collecting a double-stranded nucleic acid that does not bind to the MutM protein from the 
30 double-stranded nucleic acid sample. 

In the methods mentioned above, the purity can be increased, if necessary, by 
repeating processes (a) and (b) several times. 
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The method for detecting the cleavage of a test double-stranded nucleic acid by 
MutM is not limited. Examples of the detection system are as follows. 

(1) A test double-stranded nucleic acid, which is labeled with fluorescent dye or radioisotope 
on one or both strands, is cleaved by MutM protein. The cleaved nucleic acid is subjected to 

5 electrophoresis through a matrix under denaturing conditions so that the individual strands 
separate according to their lengths, i.e. as in a DNA sequencer. If the amount of nucleic acid 
is sufficient, non-labeled nucleic acids can be used for electrophoresis and then detected by 
staining with silver or ethidium bromide* 

(2) A test double-stranded nucleic acid is immobilized on a support or labeled so as to be 
10 immobilized on a support, and MutM protein is used to cleaving mismatches in the nucleic 

acid. Then the temperature of the sample is increased slowly to denature the double-stranded 
nucleic acid and cause the non-immobilized strand to etute. The melting temperature of short 
double-stranded nucleic acid such as a cleaved nucleic acid is lower than that of a longer, non- 
cleaved nucleic acid. In other words, cleaved strands of a double-stranded nucleic acid with 

15 mismatches elute first, while non-cleaved strands without mismatches elute later, effectively 
separating these nucleic acids. To increase the sensitivity, fluorescent substances such as 
FTTC or radioactive substances such as 33 S or M P can be used. Alternatively, an organic 
solvent such as fonnamide is available to denature the double-stranded nucleic acid 

The method for separating the double-stranded nucleic acid without mismatches 

20 from a test double-stranded nucleic acid by MutM protein is not limited. An example of the 
separation method is as follows. 

A test double-stranded nucleic acid is treated with MutM protein to introduce single 
nucleotide gaps into one or the other strand at mismatches, and then a nuclease such as SI 
nuclease is used to cleave the intact strand opposite such introduced gaps. Thus by the 

25 sequential action of these two enzymes, the sites of mismatches are rendered into complete 
double-stranded cleavages. Usbg the sequentially treated double-stranded nucleic acid as a 
template for polymerase chain reaction, only the non-cleaved sequence is amplified. 

The method for modifying the double-stranded nucleic acid with mismatches by 
MutM protein is not limited. An example of the modifying method is as follows. 

30 A double-stranded nucleic acid is treated with MutM protein. Ordinarily, the 

cleavage occurs on one strand at the mismatch. Then doing a nick translation reaction with 
DNA polymerase I using appropriately labeled nucleoside triphosphates, radioactive 
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substances such as 32P, fluorescent substances such as Cy5 or affinity labels such as biotin 
can be incorporated into the double-stranded nucleic acid at and adjoining the sites initially 
occupied by mismatches. 

These methods are useful for cloning various genes. The method for separating a 
5 double-stranded nucleic acid with mismatches can be used, for example, to collect single 
nucleotide polymorphism (SNP). Recently, human genome analysis has been making 
progress, and various SNPs have been collected all over the world to clarify the relationship 
between gene mutations and diseases. The method of the present invention is suitable for 
SNP collection because it enables collecting only mismatched nucleic acids. The method for 

10 separating a double-stranded nucleic acid with mismatches is also useful for cloning genes 
homologous with a certain gene. For example, when a homologue of a known gene derived 
from some organism is to be cloned from a different organism, this method makes it possible 
to select a gene whose sequence is similar to some extent but is partly different Furthermore, 
a gene whose sequence is similar but not completely the same can be selectively cloned even 

15 from the same organism* 

The method for separating a double-stranded nucleic acid without mismatches is 
useful for purifying DNA amplification products. When a gene is cloned, DNA fragments 
amplified by polymerase chain reaction (PGR) are often inserted into a cloning vector. In this 
case, mutations are sometimes introduced into PCR-amplified products by DNA polymerase. 

20 The method of the present invention enables removing the DNA amplification products with 
mutations. 

More specifically separation can be performed by for example, reacting the MutM 
protein with a double-stranded nucleic acid in a solution, reacting the resulting reaction 
mixture with an anti-MutM antibody, contacting the reaction products with protein A or 

25 protein G bound to a support such as beads, precipitating the immune complex by 
centrifugation to separate the double-stranded nucleic acid with or without mismatches to the 
supernatant or precipitate, respectively, and recovering than. The double-stranded nucleic 
acid with mismatches can be purified from the precipitate by suspending the precipitate in TE 
buffer (10 mM Tris-HCl (pH 8,0), 1 mM EDTA), adding three volumes of 3 M guanidine 

30 hydrochloride to denature the MutM protein and liberate the double-stranded nucleic acid. 
After centrifugation, the supernatant is collected as the DNA solution. The DNA contained in 
the solution can be used for cloning. 
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Furthermore, the separation can be performed using MutM protein immobilized on 
or labeled so as to be immobilized on a support There is no limitation on die labeling 
substance or the support to be used* The supports include liquid chromatography carriers 
such as HiTrap NHS-activated (Ameraham Pharmacia Biotech), magnetic beads such as 
5 Dynabeads M-450 Uncoated or M450-Tosylated (Dynal), and sensing elements of every kind 
of sensor such as sensor chip CMS (Biacorc) of surface plasmon resonance sensor. The 
MutM protein can be bound physically to Dynabeads M-450 Uncoated and chemically to the 
other supports. In HiTrap NHS-activated, die carboxy) groups of the Sepharose carrier are 
esterified with N-hydroxysuccinimide (NHS). It after the base is substituted with low-pH 

10 solution such as 1 mM HC1, the MutM protein solution is contacted with the carrier, stable 
amide bonds are formed between the amino acids of the MutM protein and the carrier. 
Therefore, when the MutM protein reacted with double-stranded nucleic acid is contacted 
with the carrier, only double-stranded nucleic acid with mismatches can be trapped by the 
carrier, the double-stranded nucleic acid without mismatches can be separated and collected. 

16 The double-stranded nucleic acid with mismatches bound to the MutM protein can be 
collected by denaturing the MutM protein with 3 M guanidine hydrochloride. 

The invention will now be further described with reference to the following non- 
limiting Examples. 

20 

Example 1 

Double-stranded DNAs forty base pairs in length were formed with and without C 
containing mismatches by annealing oligonucleotide No. 15 with individually with 
25 oligonucleotides No. 16, No. 17, No. 18 or No. 1 9. Their sequences were as follows: 

No. 1 S: tgctgagctaatcgadftgtcaagtcatgcgatacgtaac, 

No. 16: Cy5-gttacgtatcgcatgacttgac^gtcgattagctcagca, 

No. 17: Cy5-gttacgtatcgcatgacttgaca^cgaitagctcagca, 

30 No. 18: CyS-gttacgtatcgcatgacttgacaggtcgattagctcagca, 

No. 19: Cy5-gttacgtategcatgacttgacajgtcgattagctcagca 
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The double stranded DNAs were purified with MicroSpin S-200 HR columns 
(Amersham Pharmacia Biotech) and the concentrations of the purified DNAs were measured 
Double-stranded DNAs (2—16 nM final concentration) and MutM protein (2 pM final 
concentration) woe mixed in KC1 buffer and incubated at 37° C for 1 to 3 hours. After the 
5 reaction, the cleavage activity was determined with an automated DNA sequencer by analysis 
of the DNA fragment length. Figure 5 shows results of the fragment analysis. It was found 
that C/C and C/T mismatches are substrates for cleavage by MutM. 

Example 2 

10 The binding between a mismatched DNA and E. coli MutM was analyzed with an 

affinity sensor manufactured by BIACORE. The synthetic biotinylated oligonucleotide (B- 
gttggagcangtggtgttgg, SEQ ID NO: 1; B represents biotin, and n represents A, G, C, or T) was 
fixed on the sensor tip SA (BIACORE) at about 1,300 RU (1 RU corresponds to about 1 
pg/mm 2 substance density) and annealed to the second synthetic oligonucleotide 

15 (ccaacaccacntgctccaac: SEQ ID NO: 2) in 6 x SSC (90 mM sodium citrate (pH 7.2), 0.9 M 
sodium chloride). In this process, about 1,200 RU oligonucleotide was annealed* The 
remaining single-stranded oligonucleotides were blocked by washing with 90 g/ml SSB (a 
single-stranded DNA binding protein). About 400 nM purified MutM was added, monitoring 
the binding to each double-stranded oligonucleotide, KC1 buffer (50 mM Hepes-KOH (pH 

20 7.2), 100 mM KC1, 1 mM EDTA, ImM DTT, and 5 mM MgCl 2 ) was used as a running buffer. 
All manipulations were performed act 25 °C. 

The result is shown in Fig. 1. A) shows interaction of MutM with the mismatches 
between A and A, C or G, and the A/T complementary double strands: B), interaction of 
MutM with the mismatches between C and A, C or T, and the C/G complementary double 

25 strands: C), interaction of MutM with the mismatches between G and A, G or T, and the G/C 
complementary double strands; and D), interaction of MutM with the mismatches between T 
and C, G or T, and die T/A complementary double strands. MutM was contacted with the 
double-stranded oligonucleotides for 25 sec to 1 min, and switched to KCl buffer from 85 sec 
to wash out non-specific absorbed substances. As is clear from Fig. 1, MutM was srongly 

30 bound to C/C, C/T and T/C, and C/A and A/C, while it was not bound to any complementary 
double-stranded oligonucleotides, such as A/T, C/G, G/C, and T/A. 
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Example 3 

E. coli MutM was used to purify DNA amplification products. Two kinds of PCR 
primers (GTAGTTGAAQAAn^CTGAATGAGCCAITTATC, SEQ ID NO: 3; the EcoBl 
restriction site is underlined, and AGCGC CTGCAGC GGGGTGAGTGAATCCGGAT, SEQ 
5 ID NO: 4; the Pstl restriction site is underlined) (25 pmol each) were added to the five PCR 
Beads (Amersham Pharmacia Biotech), and the total volume was made to 25 (ji with 
sterilized water. One platinum loopful of £. coli was suspended in this solution and PCR was 
conducted with 30 cycles at an annealing temperature of 55°C The DNA fragment of 870 bp 
was amplified in this reaction* After the reaction, 25 p 1 of TE buffer (10 wM Tris-HCl (pH 

10 8.0), 1 mM EDTA) was added to the reaction mixture and the unreacted primers woe 
removed using Micraspin column (MicroSpin S-400 HR Amersham Pharmacia Biotech). 
Twenty-one microliters of 20 X SSC buffer were added to 50 pi of the purified PCR product 
solution, and annealing was performed by heating to 95°C and then gradually cooling. The 
unannealed single-stranded DNA was removed by the Microspin column, the same volume of 

15 2 X KC1 buffer was added to the reaction mixture, and the all the reaction mixtures in the five 
beads were gathered to pass through a MutM coupled affinity column described below. 

One milliliter of 100 pM MutM solution was coupled to HiTrap NHS-activated 
column (Amersham Pharmacia Biotech) following the protocol. After blocking, the column 
was equilibrated with KC1 buffer The DNA solution was passed through the MutM coupled 

20 column. The fraction that passed through the colomun was collected and concentrated to 10 
pi by ethanol precipitation, To the resulting concentrate were added 3 pi of OPA buffer 
accompanied with restriction enzymes (Amersham Pharmacia Biotech), 10 units of restriction 
enzyme EcoRl, and 10 units of Pstl. The total volume was made to 20 pi with sterilized 
water* The resulting reaction mixture was incubated at 37°C for 2 hours, mixed with 100 ng 

25 of E coli vector pTrc99A (Amersham Pharmacia Biotech) that were digested with EcoRl and 
Pst\ and precipitated with ethanol to obtain a final volume of 10 pi. The PCR fragment was 
ligated to the vector through the ligation reaction, and the ligation product was introduced into 
E. coli XL 1 -Blue, The DNA insert in the vector contained in 17 transfoxmants obtained was 
amplified by colony PCR using the same PCR primers used in the first reaction* 

30 The efficiency of mutagenesis in the DNA insert was evaluated by analyzing about 

300 bp nucleotide sequence with a sequencing primer (CyS- 
ACCGGTCCGAACATGGGCGGTAAA, SEQ ID NO: 5). As a result, the 17 clones that 
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were subjected to the nucleotide sequence analysis did not contain any mutated DNA insert* 



Exapiple 4 

MutM was used to clone genes with mutation. One milliliter of 3M guanidine 
5 hydrochloride was passed through the MutM coupled column, to which the DNA was applied 
in Example 2, to recover the bound DNA. The collected DNA fraction was concentrated to 
10 pi by ethanol precipitation and used to transform the DNA vector pTYc99A in the same 
manner as in Example 2, The nucleotide sequence analysis of the DNA insert in the vectors 
contained in three transform ants obtained detected the mutation from G to C in one clone. 

10 

The binding between a DNA having continuous mismatches with different length 
and E. coli MutM was analyzed with an affinity sensor The synthetic biotinylated 
oligonucleotides (B-tggtggttggagcaggtggtgttgggaaaa, SEQ ID NO: 6; B- 

15 tggtggttggagcacgtggtgttgggaaaa, SEQ ID NO: 7; B-tggtggttggagcaccgtggtgttgggaaaa, SEQ ID 
NO: 8; B-tggtggttggagcac££gtggtgttgggaaaa, SEQ ID NO: 9; and B- 
tggtggttggagcaccccgtggtgttgggaaaa, SEQ ID NO: 10; B represents biotin, and the mismatches 
are underlined) were fixed on the sensor tip SA (BIACORE) at about 700 RU and annealed to 
the second synthetic oligonucleotides (ttttcccaacaccacctgctccaaccacca, SEQ ID NO: 11 for 

20 SEQ ID NO: 6 and SEQ ID NO: 7, ttttcccaacaccaccctgctccaaccacca, SEQ ID NO: 12 for SEQ 
ID NO: 8 f ttttcccaacaccacccctgctccaaccacca, SEQ ID NO, 13 for SEQ ID NO: 9, 
ttttcccaacaccaccccctgctccaaccacca, SEQ ID NO. 14 for SEQ ID NO: 10; the mismatches are 
underlined) were annealed in 6 x SSC. The running buffer was replaced by KCI buffer, and 
200 nM purified MutM was applied to the column to monitor the binding to each double- 

25 stranded oligonucleotide in the same manner as in Example 1. All manipulations were 
conducted at 25°C. 

The result was shown in Fig. 2, The longer the continuous mismatch was, the less 
the amount of the bound MutM was. However, MutM had the obviously higher binding 
activity compared with the complementary double strands used as a control. 

30 

gxaiqpfc 0 

The binding between a DNA having C/C mismatches and several nucleotide bases of 
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C in one strand, and E. coli MutM was analyzed with an affinity sensor. The synthetic 
biotinylated oligonucleotide (SEQ ID NO: 7) was fixed on the sensor tip SA at about 700 RU 
and annealed to SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, or SEQ ID NO: 14 in the 
same condition as in Example 4, monitoring the binding of MutM to the double-stranded 
5 oligonucleotides in the same manner as in Example 4. 

The result is shown in Fig. 3. The amount of the bound MutM decreased as the 
number of inserted C increased. However, the binding at 400 RU or higher was observed even 
at 250 sec, demonstrating that such mutation can be detected by MutM. 



10 Example 7 

The binding between an insertion or deletion mutant DNA and E. coli MutM was 
analyzed with an affinity sensor. The synthetic biotinylated oligonucleotide (SEQ ID NO: 6) 
was fixed on the sensor tip SA at about 700 RU and annealed to SEQ ID NO: 11, SEQ ID 
NO: 12, SEQ ID NO: 13, or SEQ ID NO: 14 in the same condition as in Example 4, 

15 monitoring the binding of MutM to the double-stranded oligonucletides in the same manner 
as in Example 4. 

The result is shown in Fig. 4. The binding amount of MutM decreased as the number 
of the inserted nucleotide bases decreased. However, MutM exhibited higher binding activity 
than the control complementary DNA in any cases. 

20 

Effect of the Invention 
The present invention has revealed that MutM protein is capable of recognizing any 
mismatches involving cytosine in a nucleic acid. Efficient detection of mismatches in a 
double-stranded nucleic acid is enabled using this characteristic of the MutM protein. It has 
25 also become possible to efficiently separate a double-stranded DNA with mismatches from 
DNA without mismatches. 

The methods of the present invention enable efficiently detecting mismatches 
between pyrimidines, which was difficult to be detected by the conventional method using 
MutS protein and separating a nucleic acid having such mismatches. Moreover, the present 
30 invention can be suitably applied to detection of multiple continuous mismatches, a mismatch 
between a single base and multiple bases, and further, the mismatches generated by deleting 
or inserting one or more bases in one strand of a double-stranded nucleic acid. The methods 
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of the present invention are widely applicable to, for example, gene diagnosis or purification 
of DNA amplification products. 

Brief Description of the Invention 
5 Figure 1 shows the oligonucleotide sequences used for evaluating the binding 

characteristic of MutM, and the results. In each panel, the oligonucleotide used was shown in 
the left, and the upper sequence in the double strand show the fixed oligonucleotide and the 
lower sequence, the annealed sequence* "N" in the sequences represents any one of A, C, G 
or T. A) shows the binding to the double-stranded oligonucleotides having mismatches 

10 between A and N or complementary double-stranded oligonucleotides; B) v the binding to the 
double-stranded oligonucleotides having mismatches between C and N or complementary 
double-stranded oligonucleotides; C) t the binding to the double- stranded oligonucleotides 
having mismatches between G and N or complementary double-stranded oligonucleotides; 
and D), the binding to the double-stranded oligonucleotides having mismatches between T 

1 5 and N or complementary double-stranded oligonucleotides. 

Figure 2 shows the result of analyzing the binding of the MutM protein to the DNA 
having continuous mismatches with different length, i.e., the DNA comprising 1 to 4 C/C 
continuous mismatches (C/C, CC/CC, CCC/CCC, and CCCC/CCCQ, and to the DNA 
without mismatches (G/C). 

20 Figure 3 shows the result of analyzing the binding between the MutM protein and the 

DNA with C/C mismatches and several bases of C at one strand. 

Figure 4 shows the result of analyzing the binding between the insertion or deletion 
mutant DNA and the MutM protein. 

25 Fig. 1 
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30 Relative response 
Time (sec) 
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Fig. 3 
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Claims 
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L A method for detecting mismatches in a double-stranded nucleic acid* wherein the 
method comprises 

5 (a) contacting a test double-stranded nucleic acid with MutM protein and 
(b) detecting the binding of said double-stranded nucleic acid to said protein. 

2. The method of claim 1, wherein MutM protein is derived from £ colt 

3, The method of claim 1 or 2, wherein the test double-stranded nucleic acid is binding to a 
support or labeled so as to bind to a support. 

10 4. The method of claim 1 or 2, wherein MutM protein is binding to a support or labeled so 
as to bind to a support. 

5. The method of any one of claims 1, 2, or 4, wherein the test double-stranded nucleic acid 
is detectably labeled. 

6, The method of any one of claims 1 to 3, wherein MutM protein is detectably labeled. 
15 1. A method for detecting mutations in a nucleic acid, wherein the method comprises 

(a) providing a test nucleic acid and a control nucleic acid* 

(b) hybridizing said test nucleic acid with the control nucleic acid, 

(c) contacting the double-stranded nucleic acid formed by hybridization with MutM protein, 
and 

20 (d) detecting the complex between heteroduplex nucleic acid in said double-stranded nucleic 
acid and said protein. 

8. The method of claim 7, wherein MutM protein is derived from E. colL 
9. The method of claim 7 or 8, wherein the test nucleic acid or the control nucleic acid is 
binding to a support or labeled so as to bind to a support. 
25 1 0, The method of claim 7 or 8, wherein MutM protein is binding to a support or labeled so 
as to bind to a support. 

1 1 . The method of any one of claims 7, 8, or 10, wherein the test nucleic acid or the control 
nucleic acid is detectably labeled. 

12. The method of any one of claims 7 to 9, wherein MutM protein is detectably labeled. 
30 13. A method for separating a double-stranded nucleic acid with mismatches from a 

double-stranded nucleic acid sample, wherein the method comprises 

(a) contacting the double-stranded nucleic acid sample with MutM protein and 
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(b) collecting a double-stranded nucleic acid that forms a complex with MutM protein from 
the double-stranded nucleic acid sample. 

14. A method for separating a double-stranded nucleic acid without mismatches from a 
double-stranded nucleic acid sample, wherein the method comprises 

5 (a) contacting the double-stranded nucleic acid sample with MutM protein and 

(b) collecting a double-stranded nucleic acid that does not bind to MutM protein from the 
double-stranded nucleic acid sample. 

15. The method of claim 1 3 or 14, wherein MutM protein is derived from K colt 

16. The method of any one of claims 13 to 15, wherein the double-stranded nucleic acid is 
10 a DNA amplification product. 

17. The method of any one of claims 13 to 16, wherein MutM protein is binding to a 
support or labeled so as to bind to a support* 

1 8. A method for detecting mismatches in a double stranded nucleic acid, wherein the method 
comprises 

15 (a) contacting a test double-stranded nucleic acid with MutM protein and 

(b) detecting the cleaving of said double-stranded nucleic acid by said protein. 

19. A method for separating a double-stranded nucleic acid without mismatches from a 
double-stranded nucleic acid sample, wherein the method comprises 

(a) contacting the double-stranded nucleic acid sample with MutM protein and 
20 (b) collecting a double-stranded nucleic acid that is not cleaved by MutM protein from the 
double-stranded sample. 

20. A method for modifying a mismatch-containing double-stranded nucleic acid, wherein 
the method comprises 

(a) containing the double-stranded nucleic acid sample with MutM protein and 
25 (b) modifying further a double-stranded nucleic acid that is cleaved by MutM protein in the 
double-stranded sample. 
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Fig. 1 
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CAACCTCGTACACCACAACC 



■GTTGGAGCANGTGGTGTTGG 
CAACCTCGTCCACCACAACC 



■GTTGGAGCAtiGTGGTGTTGG 
CAACCTCGTICACCACAACC 
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Fig. 5 

Mismatch cleavage by MutM 
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